Skip to content

Conversation

@christian-oudard
Copy link

Summary

There is a bug in crwl --deep-crawl, where it will only output the first crawled page in markdown format.

List of files changed and why

  • crawl4ai/cli.py, the fix
  • tests/cli/test_cli.py, the tests for it

How Has This Been Tested?

Manual testing on various websites. Unittests.

Checklist:

  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • I have added/updated unit tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes

@vikas-gits-good
Copy link

I have opened a PR for a similar issue here.

Fixing this pull request will only deal with the cli part. My fix should solve both.

@unclecode unclecode changed the base branch from main to develop February 1, 2026 06:21
@unclecode unclecode merged commit 37995d4 into unclecode:develop Feb 1, 2026
@unclecode
Copy link
Owner

Merged into develop — thanks for the fix and the thorough tests. Will be in the next release, and we'll add you to CONTRIBUTORS.md.

unclecode added a commit that referenced this pull request Feb 1, 2026
- PR #1667: Fix deep-crawl CLI outputting only the first page
- PR #1296: Fix VersionManager ignoring CRAWL4_AI_BASE_DIRECTORY
- PR #1364: Fix script tag removal losing adjacent text
- Fix: restore .crawl4ai subfolder in VersionManager path
- Close #1150 (already fixed on develop)
- Update CONTRIBUTORS.md and PR-TODOLIST.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants